AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Real-time Image Captioning

# Real-time Image Captioning

Moondream 2b 2025 04 14 4bit
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.
Image-to-Text Safetensors
M
moondream
6,037
38
Clip Gpt2 Finetuned
This is a fine-tuned version of CLIP-GPT2 for real-time image captioning tasks, designed to assist visually impaired individuals in understanding image content.
Image-to-Text Transformers
C
vidi-deshp
18
0
Moondream2 Llamafile
Apache-2.0
moondream2 is a compact vision-language model specifically designed for efficient operation on edge devices, offering convenient deployment through the llamafile format.
Image-to-Text
M
cjpais
310
30
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase